|
|
Accession Number |
TCMCG075C04479 |
gbkey |
CDS |
Protein Id |
XP_017978689.1 |
Location |
complement(join(37104311..37104622,37105522..37105609,37105691..37105797,37105887..37105948,37106142..37106199,37106283..37107158,37107476..37107830,37107984..37108084,37108920..37109054)) |
Gene |
LOC18614643 |
GeneID |
18614643 |
Organism |
Theobroma cacao |
|
|
Length |
697aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018123200.1
|
Definition |
PREDICTED: nucleolin 1 [Theobroma cacao] |
CDS: ATGGGCAGCGTGGATCGAGTTGATGACTTGACATTTAAGGTTAATTTTAGCGGAGATGGAGCAGCCAAGTTGAGAGAGAGAGTCAAGGATAAACTCAAGGAGTTTATGGGCGACTACACTGATGACACTCTTGTGGAATATGTGATTGTCTTATTGAGAAATGGTAGACGTAAAGATGAAGCAAGGAATGAGTTGAATGTTTTTCTTGGGGATGACAGCGATTCTTTTGTCTCCTGGTTGTGGGATCATCTGGCTTCAAATTTGGATCTATATGTTCCTTCTCAGGAACCTCATGGGGAAGAAGCAGCCAAAACAAGATGCATACTGGGGAACCAGCTGGCTGGTGCTGATGCTCAATTGGATTCTGATTCTGAAAGAGGAAAGTCTACTAAGTTGGCTAGGAACCGGCACAATAGGGAGTGGAAGGGCCTAGTTCAGGATGCTTCTGAACCACCTCCTCTTCGGAGCTCTGAGGTTGAGAATATTCGTTTTGAGGAAAAAACTCGTCGAAAAGTAAGCCGTGGAAGATCTTCCTCCCCTCGTCCTAGTCAGAAGAAAAGGAGCAGAATTGATGACCGACAACCTATAAAGAGGGAGGAAGTTTCTCAGATGACCATAGATGCCCCTAGGCGGCTGCTTCAGTTTGCAGTGCGAGATGCAGTTGGAACTTCAAGGTCACCTATTTCAGCAAAAGAGTCCTCATTCAAGCGCCTTCGGTCTGTGGTGTCAACATCTTCTGGAGATTCATCACTACCTGATCGTCCTAGGAGAATTCGGTCTGTTGCGAGAGTGCCAAATCTGATGACAACAATGCTTAAAGCTGTGGCAGAAGCTGCTGAAGATGTGGCAAAGGTCAAAACTGCTGGAAGTGTGTTTGACAGGCTTGGTCCTGGAATGGATGTTTTGAAGACCCATGACCGACATGCGGCATATAGAGAATCTCTTGCTGAGGATGAAGAGTATGGAGATCTTAAGCAACCACTGGAGAATATCCAATCAGCATATCTTCAAAGAAATGAGTATGCGGGACAACATGTCGGTAACATGACAGCATTAGAAAGTCAGACTGTGTTGGCTTTAGATTCTCTGTCTGACAATGAAGCATATGATGATGTTAATGTTGCTGGCCATGGAGTTATGGATGTGTCTCAGACTGGTACATCTAGTGGAAACAAGGGTGATAACTCACTTGTGGTGCAATACAGTGTAGCTAGGCATGATGAACTCATGCAAAGAACAAGGAACAAAGACCACAATCAATCTACTGCAGCAGCAAATACTTCTCGTAAGATAGTCAACAGTTCTGTCAATGTAAATACTTGGAAACCACCTCATTATCAAGAGCCAAGGGAGGTTTCAGAATTTGGCAGTCAAAGTTCTCTTCAGGAGATTGAAGCAGTCGCTAGCAAATCTAATCTTAGATTGATGAAGGAGAATGGCAACCCTGTTACTGTTGGTAATGGAAATGTAAAAAGTGCTGGTGATATTCAAGAAATGCCTCAGAAGACAGTGCAGTCTTTTTCTGTTCCCTATGCTGCTGCACGACCTTTAGAGGATGCTGATTCTCGGACGATCTTTGTAAGCAATGTTCATTTTGCTGCTACCAAGGACAGTCTTTCTCGGCATTTTAATAAGTTTGGGGAAGTGCTAAAAGTTGTTATAGTTACAGATGCGGCGACAGGGCAACCAAAAGGGTCAGCTTATGTGGAGTTTATGCGTAAGGAAGCAGCAGACAATGCTTTATCCCTTGATGGTACCTCTTTCATGTCACGGATTCTTAAGGTTGTGAAGAGAAGCTCTGCTCATCAAGAAGCTGCTCCTGTCATGACATGGCCCCGCATTGCCCGAGGCTCTCCATTTGCTGCTGCAAGGTTTGCTAGAGCCCCTTTTCCCAGAGGTATCCCTGGTGCATATAGGCCTCGCCTTCCTTTTAAGCCTGGTGCCAGAAGCTTGCAGTGGAAGCGTGATGCTCAGGCCACTCCAGCCGATGCTGCTGCTTCGTTCACTGGGAACAGTGTTTTTTCTCCCACTGCCCGCAGTCTCACCTATGTCCGAACAGAGCCTAAATCGGAGGGGAATGCCAGTAGCTCCTAG |
Protein: MGSVDRVDDLTFKVNFSGDGAAKLRERVKDKLKEFMGDYTDDTLVEYVIVLLRNGRRKDEARNELNVFLGDDSDSFVSWLWDHLASNLDLYVPSQEPHGEEAAKTRCILGNQLAGADAQLDSDSERGKSTKLARNRHNREWKGLVQDASEPPPLRSSEVENIRFEEKTRRKVSRGRSSSPRPSQKKRSRIDDRQPIKREEVSQMTIDAPRRLLQFAVRDAVGTSRSPISAKESSFKRLRSVVSTSSGDSSLPDRPRRIRSVARVPNLMTTMLKAVAEAAEDVAKVKTAGSVFDRLGPGMDVLKTHDRHAAYRESLAEDEEYGDLKQPLENIQSAYLQRNEYAGQHVGNMTALESQTVLALDSLSDNEAYDDVNVAGHGVMDVSQTGTSSGNKGDNSLVVQYSVARHDELMQRTRNKDHNQSTAAANTSRKIVNSSVNVNTWKPPHYQEPREVSEFGSQSSLQEIEAVASKSNLRLMKENGNPVTVGNGNVKSAGDIQEMPQKTVQSFSVPYAAARPLEDADSRTIFVSNVHFAATKDSLSRHFNKFGEVLKVVIVTDAATGQPKGSAYVEFMRKEAADNALSLDGTSFMSRILKVVKRSSAHQEAAPVMTWPRIARGSPFAAARFARAPFPRGIPGAYRPRLPFKPGARSLQWKRDAQATPADAAASFTGNSVFSPTARSLTYVRTEPKSEGNASSS |